Measurement Uncertainties of Three Score Distributions and Two Thresholds with Data Dependency

نویسندگان

  • Jin Chu Wu
  • Alvin F. Martin
  • Craig S. Greenberg
  • Raghu N. Kacker
چکیده

The National Institute of Standards and Technology conducts an ongoing series of Speaker Recognition Evaluations (SRE). Recently a new paradigm was adopted to evaluate the performance of speaker recognition systems in which three distributions of target, known non-target, and unknown non-target scores, as well as two thresholds were employed. The new detection cost function was defined to be an average of the two weighted sums of the probabilities of type I and type II errors corresponding to the two decision thresholds. In addition, data dependency due to multiple use of the same subjects is also involved. The data were reorganized into a two-layer structure in view of the data dependency and the probability theory. Then, the uncertainties of the detection cost functions were computed using the nonparametric three-sample two-layer bootstrap method. Comparing these results with those calculated by using all the raw data and the nonparametric three-sample bootstrap method with the i.i.d. assumption, the measurement accuracies, i.e., the detection cost functions, have changed little; but the measurement uncertainties, i.e., the standard errors of the detection cost function, have improved as a result of taking account of the data dependency. Forty speaker recognition systems were used as examples. Index Terms – Metrology, measure, accuracy, uncertainty, bootstrap, data dependency, speaker recognition. This publication is available free of charge from http://dx.doi.org/10.6028/NIST.IR.8025

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Risk measurement in the global supply chain using monte-carlo simulation

Nowadays, logistics and supply chain management (SCM) is critical to compete in the current turbulent markets. In addition, in the global context, there are many uncertainties which affect on the market. One of the most important risks is supplier disruption. The first step to cope with these uncertainties is quantifying them. In this regard many researches have focused on the problem but measu...

متن کامل

Data Dependency on Measurement Uncertainties in Speaker Recognition Evaluation

The National Institute of Standards and Technology (NIST) conducts an ongoing series of Speaker Recognition Evaluations (SRE). Speaker detection performance is measured using a detection cost function defined as a weighted sum of the probabilities of type I error and of type II error. The sampling variability can result in measurement uncertainties. Thus, the uncertainties of the detection cost...

متن کامل

A two-level system model for score-based measurement

The normalized gain (or the g-factor) has been widely used in physics education community as an assessment measure for student performance. In particular, it allows researchers to compare different instructions using classes with different initial states. Systematic differences were identified by R. Hake with thousands of students, which show that classes, however different initially, tend to h...

متن کامل

A Single Machine Capacitated Production Planning Problem Under Uncertainty: A Grey Linear Programming Approach

The production planning is an important problem in most of manufacturing systems in practice. Unlike many researches existing in literature, this problem encounters with great uncertainties in parameters and input data. In this paper, a single machine capacitated production planning problem is considered and a linear programming formulation is presented. The production costs are assumed to be u...

متن کامل

Psychometric Properties of the Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite Scale

BACKGROUND The Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite are two commonly used outcome measures in Myasthenia Gravis. So far, their measurement properties have not been compared, so we aimed to study their psychometric properties using the Rasch model. METHODS 251 patients with stable myasthenia gravis were assessed with both scales, and 211 patients returned fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014